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Most methods proposed to uncover communities in complex networks rely on their struc- 
tural properties. Here we introduce the stability of a network partition, a measure of its 
quality defined in terms of the statistical properties of a dynamical process taking place 
on the graph. The time-scale of the process acts as an intrinsic parameter that uncov- 
ers community structures at different resolutions. The stability extends and unifies stan- 
dard notions for community detection: modularity and spectral partitioning can be seen 
as limiting cases of our dynamic measure. Similarly, recently proposed multi-resolution 
methods correspond to linearisations of the stability at short times. The connection be- 
tween community detection and Laplacian dynamics enables us to establish dynamically 
motivated stability measures linked to distinct null models. We apply our method to find 
multi-scale partitions for different networks and show that the stability can be computed 
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efficiently for large networks with extended versions of current algorithms. 

The relation between the structure of a network and the dynamics that takes place on it 
has been studied extensively in the last years^^. A growing body of research has shown how 
processes such as diffusion and synchronisation are affected by the underlying graph topology 
and, conversely, how such dynamical processes can be used to extract information about the 
network and reveal its structural properties. This two-way relationship is particularly relevant 
when the network is composed of tightly-knit modules or communities^'^'^^^^. On the one 
hand, it has been observed numerically that the presence of communities enhances partially 
coherent dynamics in networks^, and has a direct effect on the emergence of co-operation— 
and the coexistence of heterogeneous ideas in a social network—. On the other hand, dynamical 
processes such as random walks— and synchronization— have been proposed as empirical 
means to uncover the community structure of networks. 

In parallel to these dynamical studies, there has been extensive research on community de- 
tection based on structural properties of graphs Several methods have been proposed in 
order to partition a network into communities, as a way to coarse-grain the level of description 
of the system but also to identify underlying, often unknown, functionalities or relationships 
between the nodes^. At the heart of these methods, there is always a definition for what is 
thought to be the goodness of a partition. In the last decades, a variety of quality functions have 
been proposed, such as cut, normalized cut, modularity and extensions of modularity, together 
with heuristics in order find the partition optimizing the particular quality function. These defi- 
nitions have in common to be combinatorial, in the sense that the quality of a partition is always 
measured by counting the number of links within/between the communities. A good example 
is the widely-used modularity-^^ of a partition V, which measures if there are more edges 
within communities than would be expected on the basis of chance. In the case of undirected 
and unweighted networks, modularity reads 




(1) 
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Here, A is the adjacency matrix, where Aij is equal to 1 if there is a link between node i and 
node j, and zero otherwise; the degree of node i is defined as ki = J2j ^ij'^ = J2i j ^ijf^ 

is the total number of links in the network. The summation is performed over all pairs of nodes 
belonging to the same community C of the partition V. Effectively, modularity therefore counts 
the number of links within communities and compares it to the expected number of such links in 
an equivalent null model. The standard choices for _Py are the uniform model Pij = {k)'^/2 and 
the Newman-Girvan model Pij = kikj/2m, which is closely related to the configuration model 
and is often preferred because it takes into account the degree heterogeneity of the network—. 
Despite some limitations, modularity Q has revealed a crucial quantity which is now at the heart 
of several community detection methods, and has been generalised to weighted and/or directed 
networks (see Supplementary Information). 



Stability of a partition 

The purpose of this paper is to clarify the relationship between dynamics and structure by 
proposing a general framework in order to measure the quality of a partition in terms of the 
properties of a stochastic process applied on the graph. To do so, we generalize a recent ap- 
proach where the quality of the partition of a graph is expressed in terms of its stability, an 
autocovariance function of a Markov process on the network—. Without loss of generality, let 
us describe some Markov process A4 in terms of the motion of a random walker moving on the 
graph. Under the single condition that Ai is ergodic, namely that any initial configuration will 
asymptotically reach the same stationary solution, the stability of the partition V is defined as 

RM{t) = Y,PiC,t) -P{C, oo) (2) 
cev 

where P{C, t) is the probability for a walker to be in the same community C initially and at 
times t when the system is at stationarity. Because the dynamics is ergodic, the memory of 
the initial condition is lost at infinity, and P{C, oo) is therefore also the probability for two 
independent walkers to be in C. Stability measures the quality of a partition in terms of the 
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persistence of the dynamics by giving a positive contribution to communities from which a 
random walker is unlikely to escape within the given time scale t. 

Stability differs from modularity in several aspects that we will develop throughout this 
paper. First, stability is based on flows of probability on the graph and therefore captures how 
the global structure of the system constrain patterns of flows, while modularity focuses on 
pairwise interactions and is blind to such patterns, thereby neglecting important aspects of the 
network architecture'"^. Second, stability describes the quality of the partition of a graph at 
different time scales and this quantity is, in general, optimised by different partitions at different 
times. The measure RMi^) can thus be used as an objective function to be maximised at each 
time, thereby leading to a sequence of optimal partitions at different times. We will show below 
that time is a resolution parameter that allows to unravel the structure of the network at different 
scales, and that several heuristics measures correspond to the stability of some process at a 
certain time scale. Finally, our work shows that different Markov processes applied on the same 
graph lead to different versions of stability, thereby leading to different optimal partitions. This 
result underlines that the definition for a good partition into communities should depend on the 
nature of the network and of the dynamics taking place on it. The flexibility in the definition of 
RM{t) has therefore the advantage to allow a user to chose a particular process, e.g. modelling 
how information or energy flows on the graph, in order to define an adapted quality function. 

Undirected vs directed networks 

In order to clarify the above concepts, let us first focus on the discrete-time dynamics of an 
unbiased random walker, which is a prototype model for diffusion of information. In general, 
the density of random walkers on node i at step n, denoted by Pi-n, evolves according to: 



of node j. The right-hand side accounts for the walkers situated at j at time n and arriving at 
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where Aij is now the weight of the link going from j to i, and k° 



Aij is the out- strength 
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i by following a link going from j and i. In this process, the probability of a walker to move 
between two nodes is proportional to the weight of the link. 

For a generic undirected networlsQ, for which the adjacency matrix is symmetric Aij = Aji 
(and therefore = /c*" = ki), it is known that the stationary solution of the dynamics is given 
by p* = ki/2m, where m is now the total weight in the network. For such a process, the prob- 
ability that a walker is in a community C at stationarity is J2jec kj/2m while the probability 
of a random walker to be in C during two successive time steps is Xli jec(^*i/^i)(^i/2"^)- It 
follows that stability at time 1 is given by 



C ijGC 



kj 2m 2m 2m 



Qconli (4) 



which is equal to the (configuration) modularity. Interestingly, the configuration null model 
appears naturally from the properties of the random walk, and not as an additional choice, as in 
the static definition (fT6l ). This Markov viewpoint therefore provides a dynamical interpretation 
of modularity, which can be seen as the special case (based on paths of length one) of the 
stability Rnwii) of the partition, a more general quality measure which is now a function of 
time^. 

This connection between modularity and stability is only valid in the case of undirected 



networks. To show so, let us now focus on the same random process (1231) applied to a directed 
network. We further assume that the network is strongly connectej^ in order to ensure that the 
dynamics is ergodic. In that case, the stationary density of walkers is given by the (normalized) 

A- ■ 

dominant eigenvector tt of the transfer matrix -j^. By repeating the above steps, one finds that 



stability at t = 1 is given hy 

A, 



Uout J ' J 



(5) 



'We assume the system to be that ergodic, i.e., the network is connected and non-bipartite, in order to avoid 

unnecessary technical complications. 

^If this is not the case, it is always possible to make the dynamics ergodic by introducing "a la Google" random 

teleportations^. 

^The quality function (|5]l has been proposed independently during the revision of this paper by Kim et al.— 
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which is conceptually different from the combinatorial definition (fT6l ). and from its generaliza- 
tion for directed networks. Indeed, ([5]) favours partitions with long persistent flows of probabil- 
ity within modules, while modularity favours instead partitions with high densities of links (see 
Supplementary Information). In the case of undirected networks, these two aspects reconcile 
but they may lead to different and complementary ways to analyse the same graph when the 
links are directed. 



Time as a resolution parameter 

Further insight into our dynamical interpretation can be gained from continuous-time processes 
associated with the random walk (|23l ). If we assume that there are independent, identical ho- 
mogeneous Poisson processes defined on each node on the graph, such that the walkers jump at 
a constant rate from each node, the corresponding continuous-time process is governed by the 
Kolmogorov equation 

Pi = ^^P3 -Ph (6) 

j 

driven by the operator Aij/ kj — 5ij, which is related by similarity to the normalised Laplacian 
matrix. Here, we have adopted the notation k°^^ = ki as we will only focus on undirected 
networks from now on. Note that the stationary solution of this process is also p* = ki/2m. 
Following the above discussion, we define the continuous-time stability of a partition under such 
dynamics as the probability of a walker to be in the same community after a time t discounting 
the probability of such an event to take place by chance at stationarity: 

i?NL(t) = E E 

c ijec 

Here Bij = A^/kj and / is the identity matrix. It is possible to show that -Rnl(^) is a convex, 
non-increasing function and that it goes to zero for any partition when t ^ oo. This means that 
the memory of the initial condition is lost at infinity. 



kj k q 



t(B-I)\ _ '"J 

^-^ 2m 2m 2m 



(7) 
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The role played by time in this measure can be understood by considering the limiting 
behaviour of -RNL(i)- Let us first focus on the case t = 



where it is easy to see that _Rnl(0) is maximised when each of the nodes is in its own community, 
i.e., the optimal partition at t = is therefore the finest possible. On the other hand, in the 
limit t ^ oo, it can be shown via eigenvalue decomposition (see Supplementary Information) 
that -Rnl(^) is typically maximised by a partition into two communities in accordance with the 
normalized Fiedler eigenvector, a classic method in spectral clustering—. The optimisation of 
the stability -Rnl (t) therefore leads to a sequence of partitions where the number of communities 
typically decreases as time grows, from a partition of N one-node communities at t = to a 
two-way partition as t — oo. It is in this sense that time can be seen as an intrinsic resolution 
parameter in our measure: as time grows, the size of the communities is adjusted to reveal the 
possible hierarchical structure present in those networks for which finding just one partition is 
not satisfactory-^i^iSiM. 

This flexibility is also important to address the so-called "resolution limit" of modularity—, 
namely the fact that modularity optimisation may fail to identify the natural clustering of a 
network but instead give too coarse a partition (see Supplementary Information for a longer 
discussion). In our stability framework, partitions beyond the resolution limit are obtained for 
small time, as we approach i?NL(0), where the optimal partition is the finest possible. This 
t ^ limit has interesting properties. Firstly, the slope of R^hit) at the origin equals the cut 
fraction, i.e., the fraction of edges with extremities in different communities: —dR]<ii,/dt\t=Q = 
Cut = -Rnl(O) — Qconf • Hence partitions based on MinCut have the slowest decay of -Rnl(^) at 
the origin. Secondly, keeping linear terms in t in the short time expansion of -Rnl(^) leads to 



The approximate stability (5nl(^) is a convex combination of -Rnl(O) and the configuration 
modularity such that (5nl(1) = <5conf- In fact, (5nl(^) is equivalent up to a linear transforma- 
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RMt) ~ (1 - ^Nl(O) + t Qconf = QMt)- 
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tion to the tuneable Hamiltonian proposed by Reichardt and Bomholdt— as an ad hoc measure 
to obtain partitions beyond the resolution limit. Our analysis shows that this measure can be 
interpreted in terms of a simple linear approximation of the stability R^i,{t). However, it is 
important to remark that the stability R^Lit) provides a more complete picture of the underly- 
ing topology of the network at different scales, since it takes into account paths of any length 
between pairs of nodes. Such exploration of longer paths is essential if one is to detect groups 
of nodes characterised by longer connectivity patterns—. 



Other Markov processes 

The stability i?NL(^) emerges from the statistical properties of the normalised Laplacian dynam- 
ics (|24|) . which can be seen as a generic paradigm for the diffusion of some conserved quantity, 
e.g., information, on the network. However, many physical processes are instead governed by 
dynamics driven by the standard (combinatorial) Laplacian matrix: Aij — kidij. These include 
electrical networks— and other systems with flow conservation—, or linearisations of oscillator 
networks-i^, among many others. Such a continuous-time stochastic process is driven by the 
standard Laplacian dynamics: 

This process stems from the random walk (l23l) by assuming independent homogeneous Poisson 
processes at each node of the graph, but now with non-identical rates that are proportional to 
the strength. Hence, in contrast to process (l24l) . the probability to leave a node is proportional 
to its strengthens, and its stationary solution is uniform, p* = 1/N. The stability of a partition 
based on the dynamics of process (|25l ) is now given by 

C i,j£C 

where Kij = kj6ij. Again, keeping linear terms in t, one obtains an approximate stability valid 
in the limit t ^ 0: 

Rcdt) ^ (1 - t)i?CL(0) + tQunif = QclW, (12) 
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which is now directly related to the Erdos-Renyi (Bernoulli) version of modularity. Interest- 
ingly, Qchit) is exactly equivalent both to the multi -resolution Hamiltonian in^" (with t = I/7, 
7 being the resolution parameter in that method) and to the approach of Arenas et al.— based on 
the optimisation of Qunif for a modified network with r self-loops added (with t = (k) /{{k) +r)) 







The Laplacian processes (l24l) and (|25l) are different except for regular graphs with homo- 
geneous strength ki = (k), leading to different stabilities -Rnl(^) and Rchit) in general. This 
underscores the fact that there is no unique way to define the best partition of a network and 
that different quality functions will be more or less appropriate depending on the nature of the 
network and the dynamical processes underlying the system. For instance, the standard Lapla- 
cian (l25l) describes the (linearised) approach toward synchronisation of the Kuramoto model 
with identical intrinsic frequencies. Tracking the transients of linearised Kuramoto dynam- 
ics has been used to uncover hierarchies in networks'". Interestingly, it has been observed— 
that the partitions so uncovered optimise Qconf only for homogeneous networks; that is, when 
Qconf = Qimii- This is cxpcctcd from our analysis since Qunu is actually optimised by the 
dynamics. 

Our work indicates that different quality functions for partitions can be associated with 
linear, stable, conservative dynamics taking place on graphs through the consideration of the 
corresponding continuous-time stochastic processes. The stabilities -Rnl(^) and -Rcl(^) intro- 
duced above are just two archetypical examples (see Supplementary Information for other ex- 
amples) linked to standard forms of Laplacian dynamics concomitant with numerous physical 
and stochastic processes^- . Based on the nature of the network to be examined, this dynamical 
interpretation can aid in the definition of the most appropriate quality function to unfold the 
intrinsic sub-structure of the network at different scales. 



^In their original work. Arenas et al. focus on Qconf , but their approach can be generalized to other null models. 
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Methods 

Optimization 

Several community detection methods use modularity as an objective function to be op- 
timised in order to find the best partition of a network Although it has been shown that 
optimization of modularity is NP-complete^, several heuristic algorithms have been proposed 
to provide good approximations-. We will now show that it is always possible to rewrite the 
stability of a graph as the modularity of another symmetric graph. This observation has impor- 
tant implications, as it is therefore possible to use any algorithm, e.g. spectral or greedy, for the 
optimisation of modularity in order to optimise stability. In the case of a directed network, for 
example, where we have shown that stability R{1) fundamentally differs from modularity, one 



where the matrix Y = ^'^^ is mani 



can rewrite © as R{1) = J2cev ^ijec 

festly symmetric, and where Xij = -^TTj is the flow of probability from j to i at stationarity. 

i 

Because the inflow and outflow of probability at a node are equal at stationarity, network X is 
Eulerian, i.e., J2j ^ij = t^i = Ylj ^ji^ the strength of a node i in network Y is vr^. This 
implies that -R(l) is equal to the modularity of Y. 

This observation is not only valid at time t = 1 but can be generalized to any value of t. 
In the case of the continuous time process (l24l) . for instance, -Rnl(^) is equal to the modularity 
of a time-dependent weighted graph with adjacency matrix Xij{t) = (e*^"^"^^) kj, which is 
symmetric due to detailed balance at equilibrium. This interpretation provides the following 
intuition: by construction, the matrix X(t) explores larger and larger parts of the network while 
giving less and less weight to paths of length 1 (the links of the original matrix A). It is therefore 
expected that larger communities are found by optimising modularity of the weighted matrix 
X{t) as time is increased. The stability of a graph can therefore be optimized by first evaluating 
X and then by optimizing its modularity. In practice, the exponential of the transition matrix is 
evaluated via Pade approximation ^ and the optimization of modularity is performed by using a 
greedy algorithm- ^. Let us insist on the fact that partitions at different values of t are found in- 



^All the codes are available on \http://www. lambiotte. be\ 



10 



Lambiotte et al. 



dependently by following the described procedure. In the case of very large networks, for which 
such an optimisation or the evaluation of X is too onerous, it is instead possible to optimize the 
linearized stability Qj^i^)^ which can now be optimised at the same cost as modularity, i.e., 
0{N) for the fastest algorithms on sparse graphs. We have implemented two such algorithms 
based on a deterministic greedy heuristic— and stochastic simulated annealing—. 
Analyzing the sequence of partitions The optimization of stability over a period of time leads 
to a sequence of partitions that are optimal at different time scales. The extraction of these 
partitions is a first step in order to uncover the multi-scale modular structure of the network, but 
it still requires at least two non-trivial steps. 

On the one hand, one needs a way to select the most relevant scales of description, which 
is a well-known problem of multi -resolution methods. The significance of a particular scale is 
usually associated to a certain notion of the robustness of the optimal partition. Here, robustness 
indicates that a small modification of the optimization algorithm—, of the network—"^ or of the 
quality function^^ does not alter this partition. In this paper, we will follow the latter approach 
by defining the robustness of an optimal partition as its persistence over long periods of times. 
The persistence of a partition can be determined by looking for plateaux in the time evolution 
of summary statistics such as the number of communities. However, it is preferable and less 
ambiguous to actually compare the partitions and look for time intervals over which partitions 
are very similar A popular way to compare two partitions Vi and V2 is the normalized 
variation of information— 

logJy 

where H{Vi\V2) is the conditional entropy of the partition Vi given V2, namely the additional 
information needed to describe Vi once V2 is known. The conditional entropy is defined in 
a standard way for the joint distribution P(Ci, C2) that a node belongs to a community Ci of 
Vi and to a community C2 of V2- The normalized variation of information, which has been 
shown to be a true metric on the space of partitions, belongs to the interval [0, 1] and vanishes 
only when the two partitions are identical. Relevant time scales are identified as block diagonal 

11 



Lambiotte et al. 



regions where V{Vt, V^') is significantly small. 

On the other hand, one needs to check whether or not the sequence of partition is compatible 
with a hierarchical organisation. This problem requires the introduction of a quantity that mea- 
sures whether the communities at some time t' axe nested into the communities at a subsequent 
time t > t' . A well-known information theoretic measure is particularly adapted for such a 
purpose, namely the normalized conditional entropy 

which again belongs to the interval [0, 1], but is now an asymmetric quantity that vanishes only 
if each community of Vt is the union of communities of Vf' . The combined knowledge of V 
and H therefore allows us to uncover the significant partitions of the system and to verify if 
those partitions are organized in a hierarchical manner. 
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Supplementary Information 

Modularity for weighted or directed networks 

By definition, modularity tells us when there are more edges within communities than we would 
expect on the basis of chance. In general, this definition can be summarized by the formula 



Q = (fraction of edges within communities) 
— (expected fraction of such edges). 



(15) 



which leads to 



Q 



2m 



EE 



A.. — p.. 



if Pi, = (A;) 2/2, theng = g,„if 
if Pij = kikj/2m, then Q = Qconf- 



(16) 



in the case of undirected and unweighted networks. The null model Pij = kikj/2m is usually 
preferred because it captures the degree heterogeneity of the network. For general networks, 
the main difficulty consists in choosing a null model consistent with the network under consid- 
eration. In the case of undirected and weighted networks, modularity is usually defined as 



2m 



(17) 



Q=—yy 

cevi,jec"- 

where Aij is now the weight of a link between i and j, and ki = J2j ^ij 1^ strength of node 
i. For directed networks, it has been proposed to modify the null modified in order to account 
for the directionality of the links, thereby leading to the expression-^^ 

Q=—yy A. 

^ 2m ^ ^ 
where kf' = Y.^ and fc^* = E,- ^i*- 



2m 



(18) 
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Stability for directed networks. 

As we stressed in the main body of this article, stability Rm (t) and modularity Q differ in a 
fundamental way in the case of directed networks, and are in principle optimized by different 
partitions. While favours partitions with long persistent flows of probability within 

modules, (fTSi) favours partitions with high densities of links and is blind to the flow actually 
taking place on these links. In order to illustrate the difference, let us focus on the example 
taken from—. Optimising the modularity of this toy network leads to a partition where heavily 
weighted links are concentrated inside communities, as expected. Optimising stability, instead, 
leads to a partition where flows are trapped within modules. It is also interesting to stress 
that the partition optimising Rjm/(1) also optimises the map equation proposed by Rosvall and 
Bergstrom— . For an independent study of Rnwi^), we refer to the recent work of Kim et al— . 

Our definition of stability relies on the condition that the dynamics is ergodic. When the 
directed network is not ergodic, it is common to generalise the standard random walk (|23l) by 
incorporating random teleportations. If the walker is located on a node with at least one outlink, 
it follows one of those outlinks with probability 1 — r. Otherwise, the random walker teleports 
with a uniform probability to a random node. The corresponding transition matrix from j to i 
is given by 

(i-^)|l + ^((i-^K + ^)' (19) 

where aj is equal to 1 if j is a dangling node. This scheme is known to make the dynamics 
ergodic and to ensure the existence of one single stationary solution tTj that is an attractor of the 
dynamics. 

Resolution limit and size dependence. 

The resolution limit of modularity— is a well studied phenomenon that imposes a limit on the 
size of the smallest community one can obtain by modularity optimisation. It is a fundamental 
issue emerging from the definition of modularity. In particular, it originates from the factor 
l/2m in the null model Rij which implies that modularity depends on the size of the network 
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and not only on its local properties. It has been shown that the tuneable Hamiltonian of Re- 
ichardt and Bornholdt— , and hence our QNL{t), has a resolution limit for any given value of the 
resolution parameter 7—. However, the approximate quality measure QNL{t) has a remarkable 
property that implies that a change in the size of the system simply translates into a change in 
the time at which a partition is optimal. Let Vi be the optimal partition of a network A/i for 
some value of t, i.e., Vi optimises the quality function 



where nii is the total number of links in A/i. Let us now consider a new network Mtot made of 
the union of A/i and of another network A/2 such that there are no links between A/i and A/2. 
Let us now show that Vi still belongs to the optimal partition of Mtou but for another value of 
time. To do so, one should first note that the optimal partition of Q{t*) is made of connected 
communities, so that the optimal partition of Mtot is a union of partitions P* and P| of A/i and 
A/'2 respectively, i.e., all the nodes of a community either belong to A/l or to This implies 
that VI optimises the quantity 



construction. By comparing (l20l) and (|2T1) . it is now easy to show that Vi also maximises (|2T1) 
and therefore belongs to the optimal partition of Mtot for a value of time t* = t {nii/mtot)- 
Consequently, changing the size of the system only alters the resolution of the method. A 
similar relation can be obtained for QcL{t). 

Partitions at long times. 

In order to reveal the properties of RjyLit) in the limit t ^ 00, let us consider the spectrum 
of the matrix e^^"^^*. It is straightforward to show that its eigenvalues are real and positive. 
The largest eigenvalue Ai is equal to 1 and its associated eigenvector f = ki/2m corresponds 




(20) 




(21) 



where nitot is now the total number of links in J\ftot and where Aij is the same as in ( |20l ) by 
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to the stationary solution of the random process. All the other eigenvalues are associated with 
relaxations toward stationarity and have the form \k = e'""^^, with > and therefore vanish 
when t ^ oo. If we denote by the k}'^ largest eigenvalue and by f p^ its corresponding 



eigenvector, one can then rewrite RNiit) as 

C i,j 



2m 



(22) 



where the elements of the vectors are 1 if i G C and otherwise. The vector may be 
decomposed in the basis of eigenvectors Vj. The contribution of the first eigenvector ij is shown 
to vanish for all times and for any partition. In the limit t ^ oo, the main contribution comes 
from Vj and RNL{t) is shown to be optimised by the two-way Fiedler-type partition, deduced 
from the sign of entries of the second eigenvector of the normalized Laplacian— . The optimal 
partitions of RcL{t) when t ^ oo are obtained in the same way except that they are now based 
on the second eigenvector of the combinatorial Laplacian, as originally proposed by Fiedler' . 
Hence, at long times, RNiit) and Rciit) recover the two classical spectral algorithms based 
on the two Laplacians. 

More general random walks 

The continuous-time random walks described in this paper are based on the same discrete-time 
process 



Pi;n+1 = y2 ^Pr,n. (^3) 
^3 



3 

where a walker follows a links with a probability proportional to its weight, and only differ in 
the statistics of the waiting times between two jumps. 

P^ = Y.T^P■J-P^ (24) 

and 
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are therefore two particular cases of the continuous time process 



= 5Z 4^^(^j)Pj ~ '^{kt)Pi, (26) 



3 



where T{ki) is the rate at which random walkers leave a node of degree ki and whose stationary 
solution is = ki/T{ki). 
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Figure Legends 

Figure 1: Stability of the partition of a network. Given a network and a partition of the 
network (into two communities indicated by the red and green colors), one can evaluate the 
quality of the partition from the statistical properties of coarse-grained trajectories of a random 
walker. A partition will be of high quality at a given time- scale if the walker has high probability 
of remaining within the specified communities within that time-scale (left). If the partition is 
of low quality (right), the walker will switch at a high rate between communities. The stability 
of the partition R{t), defined in ([7]), can be interpreted as the time auto-correlation of such a 
coarse-grained signal, where a numerical value is assigned to each colour. This measure gives 
information about the average time spent by the walker inside the specified communities and 
establishes a time-dependent measure of the quality of a given partition. 

Figure 2: Sequence of optimal partitions of a hierarchical network as a function of time. 

We consider a regular hierarchical graph generated as follows^. Start with a pair of nodes 
connected by a link of weight c < 1, duplicate them and add a link of weight between all 
pairs of nodes in different modules. This meta-module of four nodes is duplicated and links 
of weight are added between nodes of different meta-modules. A fully-connected, weighted 
network of 2^ nodes is obtained by iterating this step K times. In the right upper comer, we 
represent such a network with 2^ = 16 nodes and edges shaded according to their strength 
(c = 1/4). By symmetry, the natural partitions are into 16 single nodes, 8 pairs of nodes 
(colours), 4 groups of 4 nodes (shapes) and 2 groups of 8 nodes (upper and lower hemispheres). 
The figure also shows the stability R{t) of the natural partitions of this graph. As t grows, the 
sequence of optimal partitions goes from 16 communities to 8 to 4 to 2. 

Figure 3: Sequences of optimal partitions of the karate network. We optimize the stabili- 
ties RNiit) and RcL{t) of the karate network—, which is a small social network made of 34 
nodes, (a) Normalised variation of information V{Vt,V^i) between the optimal partitions of 
i?7VL at different times t and i! . We also plot the number of communities of the correspond- 
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ing partitions. It is interesting to note that a constant number of communities does not imply 
that the partitions are equivalent, and that variations in the number of communities may have a 
marginal effect on the distance between the partitions. The most persistent partitions are high- 
lighted by dashed lines, (b) The small values of the normalized conditional entropy H{Vt, V^') 
indicate that partitions at different times are nested in each other, (d) Normalised variation of 
information V{Vt,V^i) for Rcl- A comparison of (a) and (d) shows that RNL{t) and Rciit) 
are optimized by distinct sequences of persistent partitions. This is confirmed by looking in 
(c) at V{Vt, V^i) where Vt now denote optimal partitions of R^L^t) and Vf' optimal partitions 
of RcL{t')- The differences between both measures emanate from the underlying dynamical 
processes with distinct stationary distributions that link RNL{t) to the configuration null model 
and RcL{t) to the uniform null model. 

Figure 4: Sequences of optimal partitions of a benchmark network. The benchmark net- 
work is made of 640 nodes with 3 hierarchical levels (modules of 10, 40 and 160 nodes re- 
spectively)-. The density of links within the modules at different levels is tuned by a single 
parameter p, p = 1.5 in this example. In (a) and (b), we plot the normalised variation of in- 
formation V(Vt, V^' ) and normalized conditional entropy HiVt, V^' ) for the optimal partitions 
of RNL{t) and RNL{t'). In (a), we also plot the number of communities of the corresponding 
partitions and, in dashed lines, the number of communities in the natural partitions. The optimal 
partitions are all evaluated for the same realization of the random graph. Because the network 
is homogeneous (and therefore almost regular), we do not focus on Rciit)- 

Figure SI: Modularity vs stability. In this toy network proposed by Rosvall and Bergstrom— , 
the weight of the bold links is twice the weight of the normal links. The partition on the left is 
shown to optimise Rrw{^)- The partition on the right instead optimises modularity. 



24 



Lambiotte et al. 







Figure 1. 
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Q=0.25 
Rrw(I)=0.42 



Q=0.5 
Rrw(I)=0.33 



Figure SI. 
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